Splicing Factor 
The invention relates to proteins comprising splicing 
factor activity in plants. 

Alternative pre-mRNA-splicing is part of the 
5 expression program of a large number of genes in animals 
and plants. It allows the selection of different 
combinations of splice sites within a given pre-mRNA, 
generating structurally and functionally distinct protein 
isoforms (Breitbart et al . 1987; Manley and Tacke 1996; 

10 Caceres and Krainer, 1997). Several protein factors 

involved in the regulation of alternative splicing have 
been described, including a family of RNA-binding proteins 
containing arginine/serine-rich regions (SR proteins) (for 
reviews, see Fu 1995; Chabot 1996; Valcarcel and Green 

15 1996; Caceres and Krainer 1997) . SR proteins are highly 
conserved nuclear phosphoproteins that are members of a 
protein family and share a serine phospho-epitope 
recognized by the monoclonal antibody mAbl04 (Roth et al. 
1991) . They consist at least of one RNA-binding domain 

20 (RBD) [the typical RBD or RNA recognition motif (RRM) 
domain of approximately 8 0 amino aicds] (Birney et al . 
1993), and possess several serine/arginine (SR) dipeptides 
near their carboxy termini (Zahler et al. 1992). In 
general, SR proteins are a defined subgroup of a large 

25 superfamily of nuclear proteins with RS-rich domains of 
variable sequence and position (Fu 1995) . 

The human SF2/ASF splicing factor, a prototype of an 
SR protein, is essential for the first cleavage step in 
pre-mRNA splicing (Krainer et al. 1990b; Ge et al. 1991) 

30 and can also determine in a concentration-dependent manner 
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which 5' splice site is selected in pre-mRNAs containing 
alternative sites (Ge and Manley 1990; Krainer et al . 
1990a). The preferential usage of the proximal 5' splice 
site at higher concentrations of SF2/ASF is counteracted by 
5 members of the hnRNP A/B family of proteins (Mayeda and 
Krainer 1992; Mayeda et al. 1994), suggesting that the 
relative abundance of these and possible other antagonistic 
splicing factors determines the patterns of alternative 
splicing in vitro and in vivo (Mayeda and Krainer 1992; 

10 Caceres et al. 1994; Yang et al. 1994; Wang and Manley 
1995; Hanamura et al. 1998) . SF2/ASF and any other SR 
protein can complement splicing-def icient S100 extracts 
that essentially lack SR proteins, but individual SR 
proteins sometimes display distinct specificities and 

15 efficiencies in splicing different pre-mRNAs (Krainer et 

al. 1990b; Ge et al . 1991; Fu and Maniatis 1992; Kim et al. 
1992; Mayeda et al . 1992; Fu 1993; Zahler et al. 1993; 
Screaton et al . 1995; Wang and Manley 1995). Furthermore, 
an important activity of SF2/ASF and other SR proteins is 

20 the interaction with exonic enhancer sequences, which 

stimulate usage of an adjacent weak splice site (Sun et al. 
1993; Tian and Maniatis 1993; Staknis and Reed 1994; 
Ramchatesingh et al. 1995; Tacke and Manley 1995; Lou et 
al. 1996) . The RNA-binding specificity of an SR protein is 

25 conferred by the RRM region and adjacent sequences (Caceres 
and Krainer 1993; Zuo and Manley 1993; Tacke and Manley 
1995; Allain and Howe 1997; Tacke et al. 1997); the various 
RS domains are responsible primarily for protein-protein 
interactions, which are modulated by the phosphorylation 

30 status of these regions (Wu and Maniatis 1993; Amrein et 
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al. 1994; Kohtz et al. 1994; Zuo and Maniatis 1996; Xiao 
and Manley 1997) . In addition, it has been demonstrated 
that RS domains modulate the RNA-binding activity and 
subnuclear localization of SR proteins (Li and Bingham 
1991; Hedley et al. 1995; Caceres et al. 1997). Recently, 
SF2/ASF, SRp20, and 9G8 have been shown to shuttle between 
the nucleus and the cytoplasm, and this property depends on 
the presence and type of RS domain (Caceres et al. 1998). 

The studies described above led to the proposal that 
SR proteins might function by bridging splice sites through 
RNA-protein and protein-protein interactions, therefore 
establishing early interactions for splice-site definition 
and for the assembly of spliceosomes . These interactions 
are stimulated by binding of SR proteins to nearby enhancer 
seguences and modulated by 

phosphorylation/dephosphorylation of the RS regions. 

There is limited information about the regulation of 
SR protein expression in vivo. In the case of SF2/ASF and 
SRp20, significant differences in mRNA and protein levels 
have been observed in various cell types and tissues (Jumaa 
et al. 1997; Hanamura et al. 1998). SC35 expression is also 
highly variable in cell lines (Fu and Maniatis 1992; 
Vellard et al. 1992) and some SR proteins are activated by 
mitogens (Diamond et al . 1993; Screaton et al . 1995). 
Interestingly, several alternatively spliced SR protein 
mRNAs have been described, which code for truncated 
proteins of still unknown function (Ge et al. 1991; Cavaloc 
et al. 1994; Screaton et al. 1995; Jumaa et al . 1997). 
Human SRp20 autoregulates its expression at the level of 
splicing by binding to its own pre-mRNA, thereby preventing 
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overexpression of the active protein (Jumaa and Nielsen 
1997) . it was found recently that the amounts of 
alternatively spliced mRNAs coding for SRp30b and SRp20 in 
Caenorhabditis elegans may be regulated at least in part at 
the level of mRNA stability (Morrison et al. 1997) . 

Hitherto, in organisms comprising a protein 
corresponding to the human SF2/ASF protein, merely one 
variant each of this protein has been found. 

Little is known about the mechanisms of intron 
excision in plant cells. It is generally assumed that the 
basic mechanism of splicing in plants is similar to that of 
yeast and mammals (Solymosy and Pollak 1993; Luehrsen et 
al. 1994; Filipowicz et al . 1995; Brown and Simpson 1998). 
Nevertheless, animal introns are not processed in any plant 
tissue so far examined (Barta et al. 1986; Van Santen and 
Spritz 1987; Wiebauer et al. 1988). Apparently, the process 
of intron recognition differs in these two kingdoms. One of 
the determining features of introns in plants seems to be a 
U or AU-rich character, whereas the exons are more GC-rich 
(for review, see Brown and Simpson 1998) . As SR-proteins 
play a critical role in correct splice-site selection in 
mammals, it is of interest to characterized the 
corresponding protein family in plants. When screened for 
similar proteins in plants by using the mAbl04 antibody or 
a specific monoclonal antibody raised against human 
SF2/ASF, it could be demonstrated that plants do possess SR 
proteins, including SF2/ASF-like proteins 

(Lopato et al. 1996a) . However, the plant SR proteins are 
of different size and are smaller than 55 kD. Furthermore, 
it could be shown that plant SR proteins are active in 
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constitutive and alternative splicing when assayed in 
heterologous HeLa cell extracts. 

In an effort to isolate individual plant splicing 
factors, arginine/serine-rich proteins from Arabidopsis 
5 belonging to two different families have been characterized 
(Lopato et al . 1996b; 1999). Both families show good 
homology in the amino-terminal RRM with animal SR proteins, 
but at their carboxy-terminal domains they are richer in 
arginine than in serine, have few SR dipeptides, and they 
10 were therefore termed RS proteins. Nevertheless, these 
proteins are recognized by the mAbl04 antibody and can 
complement SR protein-deficient HeLa S100 extracts 
efficiently. 

In Arabidopsis thaliana, a protein with similarity to 
15 human SF2/ASF protein could be identified (Lazar et al . 
1995) which is known under the name of SRI, atSRp34 or 
atSRp34/SRl. 

Influencing the splicing process in plants is one of 
the possible ways of arriving at improved plants. 
20 Therefore, it is an object of the present invention to 

provide new plant splicing factors which can be used within 
the scope of biotechnology. 

According to the invention, this object is achieved by 
a protein having splicing factor activity in plants, which 
25 either 

©comprises the amino acid sequence of the protein according 

to Fig. 1A of the appendix, or 
©comprises the sequence of the amino acids 1 to 4, 7 to 19, 

45 to 52, 111 to 116, and 149 to 153 of the protein 
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according to Fig. 1A of the appendix and has more than 
85% similarity with this protein, or 
©comprises more than 60% similarity with the splice 

proteins atSRp34/SRl and SF2/ASF according to Fig. 2 of 
5 the appendix, wherein the G-rich sequence, which 
corresponds to the amino acids 85 to 113 of the 
atSRp34/SRl protein is substituted by an S-rich sequence, 
or 

©corresponds to, or is derived from, the protein 

10 corresponding to Fig. 1A from a plant other than 
Arabidopsis thaliana. 

The protein according to the invention advantageously 
is characterized in that it comprises the sequence of the 
amino acids 1 to 4, 7 to 19, 22 to 72, 74 to 85, 96 to 141, 

15 149 to 153, 156 to 172, with amino acid 168 being variable, 
yet not D or N, of the protein according to Fig. 1A of the 
appendix. The highly conserved sequences indicated 
preferably are arranged such that the distances between the 
sequences indicated approximately correspond to those 

20 according to Fig. 1A of the appendix, preferably not more 
than 5 amino acid deletions or amino acid insertions being 
provided. Particularly preferred are sequences in which 
either no, or merely one single, deletion or insertion is 
present between the conserved sequences. 

25 The protein according to the invention may comprise 

one or more amino acid substitutions as compared to the 
protein illustrated in Fig. 1A of the appendix, as long as 
the sequence of the amino acids 1 to 4, 7 to 19, 45 to 52, 
111 to 116, and 149 to 153 of the protein according to Fig. 

30 1A of the appendix or the function of the protein as 
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splicing factor are not affected. Its function as a 
splicing factor is to be considered to be affected if the 
protein comprises, e.g., only 10 to 20% of the natural 
activity of the protein according to Fig. 1A of the 
appendix. 

Amino acid substitutions may particularly be provided 
in the region of the amino acids of Fig. 2 of the appendix 
not marked as conserved, in particular if they do not 
essentially influence the three-dimensional structure or 
the charge distribution of the protein. In these regions, 
e.g., an interchange of hydrophobic amino acids or an 
interchange of aromatic amino acids is possible, unless the 
natural activity is lost thereby. 

According to the invention, all proteins are preferred 
which comprise an amino acid sequence having at least 90% 
identity with the sequence of the amino acids 1 to 85 and 
96 to 222 of the protein according to Fig. 1A of the 
appendix. 

Proteins which comprise an amino acid sequence that 
has at least 95%, in particular at least 98%, identity with 
the sequence of the amino acids 1 to 85 and 96 to 222 of 
the protein according to Fig. 1A of the appendix are 
particularly preferred. 

With the proteins of the invention it has been 
possible for the first time to find a second SF2 /ASF- 
similar protein in an organism (adjacent atSRp34/SRl) , 
resulting in the possibility of a combined influencing of 
the splice process. In the appendix it is shown that the 
two proteins mutually influence the splicing process but 
also have an effect on other proteins which, moreover, with 
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the appropriate genetic control, will result in surprising 
possibilities of changing the splicing properties of the 
plant or of the plant cell. 

The S-rich sequence between amino acid 95 and 102 of 
5 the sequence according to Fig. 1A of the appendix 

preferably comprises at least 4, in particular at least 7, 
serine residues. Advantageously, it is G-free, in 
particular free from GGR or GGR repeats. 

The protein according to the invention can be provided 
10 by standard methods of molecular biology also from other 

plants, because of the high homology of such proteins, such 
as by PCR coupled cloning methods or by other screening of 
gene libraries, e.g. with the sequence according to Fig. 1A 
of the appendix. 
15 According to a further aspect, the present invention 

relates to nucleic acid molecules, which either 
©comprise a nucleic acid sequence according to Fig. 1A of 

the appendix, or 
©comprise a nucleic acid sequence encoding a protein of the 
20 invention, or 

©comprise a nucleic acid sequence which, under stringent 
conditions, binds to the nucleic acid molecule according 
to Fig. 1A and encodes a splice protein active in plants 
or is complementary thereto. 
25 By stringent conditions, e.g., hybridizing in 7% SDS, 

0.5 M NaP0 4 , pH 7.0, 1 rtiM EDTA at 50 °C followed by washing 
with 3% SDS is to be understood. 

These molecules may either be DNA or also RNA 
molecules. In the case of RNA molecules, the alternative 
30 spliced forms, in particular those described in the 
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appendix, are preferred embodiments of the present 
invention . 

Another object of the invention is a biologically 
functional vector which is characterized in that it 
comprises a nucleic acid molecule according to the 
invention . 

In a further aspect, the invention moreover relates to 
a system comprising a nucleic acid which encodes a protein 
according to any one of the claims, and a nucleic acid 
which encodes 

©the atSRp34/SRl protein of Arabidopsis thaliana or 
©the protein corresponding to Fig. 1A from a plant other 

than Arabidopsis thaliana, or 
©a protein derived from these proteins, 

wherein at least one of the nucleic acids is under the 
control of a promoter not naturally connected with these 
nucleic acids. 

By atSRp34/SRl-derived proteins also those proteins 
are to be understood which define the proteins according to 
the invention, starting from the protein illustrated in 
Fig. 1A, in analogous manner. 

Preferably, both nucleic acids are controlled by 
promoters which are not naturally connected with these 
nucleic acids. 

Preferably, at least one of the promoters under whose 
control the nucleic acids are is an inducible promoter. 

According to a preferred embodiment of the present 
invention, the nucleic acid encoding a protein according to 
the invention is under the control of a promoter which 
causes an overexpression of this protein. 
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Preferably, the nucleic acid which encodes a protein 
according to the invention is under the control of a 
promoter which, under defined conditions, prevents the 
expression of this protein and, under defined different 
conditions, enables the expression of this protein. 

According to a further aspect, the present invention 
relates to transgenic plants or plant cells which express a 
protein according to the invention and comprise a nucleic 
acid molecule, a vector or a system according to the 
invention. 

Moreover, the present invention relates to the use of 
a protein according to the invention, a nucleic acid 
according to the invention, a vector according to the 
invention or a system according to the invention for 
changing the splice properties of a plant cell or of a 
plant . 

Moreover, the subjects of the invention may be used 
for changing the development behavior of a plant. 

A particular aspect lies in the use for retarding the 
flowering stage of plants. 

According to the invention, it has been shown that by 
such a use, a retardation of the flower formation of a 
plant of at least 15%, preferably at least 25%, as compared 
to its wild-type can be induced, e.g. if the protein 
according to the invention is put under the control of a 
strong promoter. 

Particularly preferred plants within the scope of the 
present invention are economic crop plants, such as 
cereal, beans, rice or fruit plants. 
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The invention will now be explained in more detail by 
way of the following examples as well as the drawing 
figures, yet it shall not be restricted thereto. 

In the Figures, 

Fig. 1A shows a genomic sequence of atSRp30; promoter 
and intron sequences are indicated in lower-case, cDNA 
sequences in upper-case, and the TATA box in bold italics. 
The derived protein sequence is shown below the DNA in the 
one-letter code. The bold sequence in the tenth intron is 
contained in alternative spliced mRNAs, and the underlined 
sequence is included if both cryptic 3' and 5' splice sites 
are used. The conserved RNP submotifs of both RRMs are 
boxed. The open arrows mark the ends of promoter sequences 
used for expression studies. Solid horizontal arrows 
indicate the ends of the PCR product obtained with 
degenerate primers and used as a probe for library 
screening. (I) The end of the GatSRp30 clone. The sequence 
data of the atSRp30 gene was submitted to the EMBL database 
(accession number AJ131214) . 

Fig. IB shows a schematic representation of GatSRp30, 
its mRNA isoforms, and derived proteins. Exons are shown as 
boxes and introns as lines (bold lines: introns included in 
the alternatively spliced mRNAs). Exonic 5' and 3' 
untranslated regions are shaded, and the coding regions are 
black. (*) The new stop codon in the alternatively spliced 
products . 

Fig. 1C shows a schematic representation of GatSRp34, 
its mRNA isoforms, and derived proteins. The drawings are 
as in B (accession number AF001035) . 
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Fig. ID shows a partial genomic sequence of 
atSRp34/SRl starting from exon 10 up to the stop codon. The 
bold sequence in the long intron is included in 
alternatively spliced mRNAs, the underlined sequence is 
5 included when both cryptic 3' and 5' splice sites are used. 
Fig. 2 shows the alignment of Arabidopsis atSRp30, 
atSRp34/SRl, and human SF2/ASF protein sequences (solid 
area) . Positions at which a single residue occurs in at 
least two of the sequences (shaded areas) . Conservative 

10 substitutions such as RK, IVL, ED, FY, ST. The positions of 
the conserved RNP-1 and RNP-2 submotifs and the glycine- 
rich region are indicated. 

Fig. 3 shows the expression of atSRp30 and atSRp34/SRl 
in wild-type Arabidopsis thaliana. (A) Expression in 

15 different plant tissues. Northern blot analysis of poly (A) + 
RNA from leaves (L) , stems (S) , roots (R) , and flowers (F) 
is shown. One microgram of RNA was loaded per lane, and the 
blots were either probed with a probe corresponding to the 
tenth intron of GatSRp30, or with the cDNAs of atSRp30 or 

20 atSRp34/SRl. (B, C) Developmental expression of atSRp30. 

Total RNA was isolated from whole plants on different days 
during development starting from the day of germination. 
The RNA was either used for Northern blot analysis with 
10 pg/lane of total RNA and the membrane probed with 

25 atSRp30 cDNA (B) , or for analysis of the RT-PCR products on 
a 1.2% agarose gel (C) . The two primers for the PCR 
reaction were located in the exons adjacent to the tenth 
intron . 

Fig. 4 shows the immunodetection of phosphorylated and 
30 dephosphorylated SR proteins from cell cultures of 
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Arabidopsis . (A) Immunodetection of proteins in SR protein 
preparations (lanes 1-4) . (Lane 1) Monoclonal antibody 
mAbl04, which is specific for a common phosphoepitope of SR 
proteins; (lane 2) polyclonal antibody raised against 
recombinant atSRp30; (lane 3) polyclonal antibody raised 
against recombinant atSRp34/SRl; (lane 4) monoclonal 
antibody specific for human SF2/ASF. Protein markers are 
indicated at the left. (B, C) Dephosphorylation of SR 
proteins with alkaline phosphatase. (Lanes 1, 2) SR 
proteins incubated at 37 °C without enzyme for 0 and 24 h, 
respectively; (lanes 3, 4) SR proteins incubated with 0.2 
and 1.0 U/ul of enzyme for 3 h; (lanes 5, 6) SR proteins 
incubated with 1.0 and 2.0 U/ul of enzyme for 24 h. 
Proteins were probed either with anti-p30 (B) or anti-P34 
(C) . 

Fig. 5 shows the transcriptional analysis of atSRp30 
(A-D) and atSRp34/SRl (E-H) promoter-GUS fusions in 
transgenic Arabidopsis. GUS staining of (A, E) flowers at 
the postanthesis stage; (B, F) staining patterns in the 
leaves; (C, G) primary and developing lateral roots; (D, H) 
seedlings (2 days after germination) . 

Fig. 6 shows the overexpression of atSRp30 in 
transgenic Arabidopsis plants. The cDNA and the genomic 
sequences of atSRp30 were cloned under the control of the 
strong 35S CaMV promoter (constructs pC30 and pG30, 
respectively) and used to transform Arabidopsis plants. (A) 
Northern blot analysis of total RNA isolated from 
independent transgenic lines probed with atSRp30 cDNA. 
(Lanes 1-4) Plants transformed with the pG30 construct; 
(lanes 5-9) plants transformed with pC30; (lane 10) control 
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plant transformed with pBI121 (35S CaMV-GUS construct) . 
mRNA isoforms are indicated. (B) RT-PCR analysis of the 
same transgenic lines. The primers used are indicated with 
arrows in the diagram of GatSRp30. (Lane 10) Four times 
5 more material was loaded in this control lane. (C) 

Immunodetection of atSRp30 in total protein extracts of 
overexpressing transgenic plants. (Lane 1) SR protein 
preparations from Arabidopsis cell culture (wild type) ; 
(lane 2) line transformed with pG30; (lanes 3-7) lines 

10 transformed with pC30; (lane 8) pC30 transgenic line that 
does not overexpress mRNAl . Antibody raised against atSRp30 
(anti-p30) was used for immunodetection. 

Fig. 7 shows phenotypic changes in plants 
overexpressing atSRp30. (A) Plants overexpressing atSRp30; 

15 (B) wild-type plants; (C) plants overexpressing atSRp30 
grown under short day conditions displays no apical 
dominance, resulting in a bushy phenotype . 

Fig. 8 shows the regulation of the alternative 
splicing pattern of atSRp31 and U1-70K in plants 

20 overexpressing atSRp30. The relevant alternative splicing 
events are shown in the diagrams at the top. RT-PCR was 
carried out using primers (marked with arrows) derived from 
the adjacent exons . (A) Changes in splicing in the second 
intron of GatRSp31. (Lane 1) Control line transformed with 

25 pBI121 (35SCaMV-GUS construct), (lanes 2-5) lines 

transformed with the pG30 construct; (lanes 6-10) lines 
transformed with pC30. (B) Correctly spliced mRNA encoding 
U1-70K protein is produced upon overexpression of atSRp30. 
(Lane 1) control plant transformed with pBI121 (35SCaMV-GUS 

30 construct), (lanes 2, 3) lines transformed with the pG30 
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construct; (lanes 4-8) plants transformed with pC30; (lane 
9) control pC30 transgenic plant that does not overexpress 
mRNAl; (lane 10) 19-day old wild-type plant. 

Fig. 9 shows changes in the expression pattern of 
5 atSRp34/SRl in plants overexpressing atSRp30. (A) 

Preferential usage of an alternative 5' splice site in the 
eleventh intron of atSRp34/SRl. The gene, the relevant 
alternative splicing events, and the derived proteins are 
depicted at the top, the gel showing the RT-PCR products 

10 obtained with primers from the adjacent exons . (Lane 1) 
Control, untransf ormed wild-type plant; (lanes 2 to 5) 
lines transformed with the pG30 construct; (lanes 6 to 11) 
plants transformed with pC30; (lane 11) control pC30 
transgenic plant that does not overexpress mRNAl. (B) 

15 Immunodetection of atSRp34 and atSRp34s in total protein 
extracts from transgenic plants. (Lanes 1, 2) SR protein 
preparations isolated from Arabidopsis plants and cell 
culture, respectively; (lane 3) control plant transformed 
with pC30 that does not overexpress mRNAl; (lane 4) line 

20 transformed with pC30; (lane 5) line transformed with the 
pG30 construct. Anti-p34 antibodies were used for 
immunodectection. 
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Examples : 

MATERIALS AND METHODS 

Purification of SR-proteins and protein sequencing 
The SR protein purification has been described 
5 previously (Lopato et al . 1996a). The proteins were 

separated by SDS-PAGE. After Coomassie G staining, the 
protein bands were excised and subjected to in-gel 
digestion with Lys-C endopeptidase . The resulting peptides 
were separated by HPLC and sequenced by automated Edman 

10 degradation as described previously (Wang et al. 1996) . 

Isolation and sequencing of genomic DNA and cDNA of 
atSRp30 and atSRp34/SRl 

Total DNA of carrot, tobacco, and Arabidopsis was 
isolated by the cetyltrimethylammonium bromide (CTAB) 

15 method (Murray and Thompson 1980) . The most conserved part 
of the sequenced peptide YVGNLPGDI was used to design two 
degenerate forward primers. Two reverse degenerate primers 
were derived from the peptide sequence SWQDLKDHM, wich was 
also part of a sequenced peptide. All four combinations of 

20 degenerate primers were used in PCR reactions with genomic 
DNA from different plants as a template. PCR products were 
subcloned and sequenced. The PCR product from Arabidopsis 
was used as a probe for screening a genomic A.ZAPII library 
of a. thaliana var . Columbia ( Stratagene) . One positive 

25 clone, GatSRp30, was found, mapped, subcloned, and 

sequenced (EMBL accession number AJ131214) . A partial cDNA 
sequence was found by a BLAST search through the EST 
databases with the accession number R65514, 4018 
Arabidopsis thaliana cDNA clone 17J1T7, and was kindly 

30 provided by the Arabidopsis Biological Resource Center at 
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Ohio State University. Sequence homology was analyzed using 
Genetics Computer Group (GCG, University of Madison, 
Madison, WI) sequence analysis software package version 
7.1-UNIX. Sequencing of the cDNA and the genomic clones was 
5 done using an A.L.F. DNA Sequencer and Auto Read Sequencing 
kit (Pharmacia) . To obtain a cDNA clone of atSRp34/SRl, the 
EST clone with the accession number ACT76795, 11573 atcDNA 
clone 151I9T7, was sequenced. The genomic sequences for 
atSRp34/SRl (accession number AF001035) were isolated by 

10 PCR and sequenced. 

Analysis of alternatively spliced isoforms 
RNA blot analyses were done as described (Lopato et 
al . 1996b). cDNAs encoding alternatively spliced isoforms 
were obtained by RT-PCR using total RNA preparations from 

15 A. thaliana plants at different stages of development and 
primers derived from the 3' untranslated region (for 
reverse transcription) . 



(atSRp34/SRl) 

(The underlined nucleotides correspond to restriction 
sites) and from the beginning of the 5' untranslated region 
at the end of coding region for (PCR) : 



( 1 ) 5 ' -AAAT GAGCTC AAATGTATATGTATGGAAAAACC-3 ' 



(atSRp30) and 



20 



( 2 ) 5 ' - AAT GAG C T C G AAAC GAT AT C T T C AAAAAAAAAC - 3 ' 



25 



(3) 



5 ' -AAACTGGATCCAGAACAATCTAACGCTTTCTCG-3 ' 



(atSRp30) and 



(4) 



5 ' - ATAT A G GAT C C T C AAC CAGAUAU C AC AGGT G- 3 ' 



(atSRp30); and 



(5) 



5 ' -AAATATCTAGAGATCTCAAATCGACGACC-3 ' 



30 



(atSRp34/SRl) and 
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( 6 ) 5 ' -ATATA GGATCC CATTTTACCTCGATGGAC- 3 ' 
(atSRp34/SRl) . 

All products were sequenced. Alternative splicing of 
the long introns was studied using primers derived from 
adjacent exons: 

( 7 ) 5 ' -AAT GAGCTCTG TGTCACCTGCTAGATCC- 3 ' 
(atSRp30) and 

( 8 ) 5 ' -AT AT A G GAT C C AG AT AT C AC AG G T G AAAC - 3 1 
(atSRp30) ; and 

( 9 ) 5 ' -ATAGGATCCAGGAGCAGAAGTCCCAAGGCAAAG- 3 ' 
(atSRp34/SRl) and 

( 10 ) 5 ' - AAAGTCGAC AGAAGGT AGAGGAGAT CT T GAT C - 3 ' 
(atSRp34/SRl) . 

The fragments were analysed on a 1.2% agarose gel. 

Constructs for promoter analysis and overexpression 

Approximately 1 kb of promoter sequences of atSRp30 
and atSRp34/SRl plus their complete 5' untranslated regions 
(including the first intron in the case of atSRp34/SRl) 
were obtained by PCR from genomic clone GatSRp30 and 
genomic DNA of A. thaliana, respectively, as a template, 
using primers 

(11) 5 ' -AAACTAAGCTTGGTATCTTCTTCCCTGCAAG-3 ' 
(atSRp30) ; 

( 12 ) 5 ' -AAACCTAGGCGGCTACTCAGCTGATACCTCAGAGCAG-3 ' 
(atSRp30) ; 

( 13 ) 5 ' -AAACTAAGCTTAAATATTGAACCGGCCTCGGTTC-3 ' 
(atSRp34/SRl) ; 

( 14 ) 5 ' -AAACTGGATCCTCTTCCTGTTGGTCGTCGACGATTTG-3 ' 
(atSRp34/SRl) ; and 

( 15 ) 5 ' -AAACTGGATCCTCTTCCTTTATCAAATCC-3 ' 
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(atSRp34/SRl) 

containing Hindlll and BamHI restriction sites. These 
fragments were digested with Hindlll and BamHI and fused to 
the GUS reporger gene (Jefferson 1987) in pBHOl 
5 (Clontech) . 

The cDNA and the genomic sequences of atSRp30 were 
amplified from cDNA and genomic DNA from A. thaliana using 
primers 3 and 1, containing BamHI and Sad restriction 
sites, respectively. The DNA fragments were sequenced and 

10 inserted into the BamHI-SacI restriction sites of the 

pBI121 (Clontech) vector (with the GUS gene deleted) under 
the control of the 35S CaMV promoter, giving rise to 
constructs named pC30 and pG30, respectively. 

Cultivation of plants and suspension cultures and 

15 plant transformation procedure 

Conditions for maintaining carrot and tobacco BY2 
suspension cultures were as described earlier (Lopato et 
al. 1996a). The suspension culture of A. thaliana ecotype 
Columbia was a gift from Czaba Koncz (Max-Planck-Institut 

20 fur Ziichtungsforschung, Koln, Germany) and was maintained 
as described in Lopato et al. (1999) . A. thaliana ecotype 
Columbia was used in all transfer experiments. Seeds were 
surface-sterilized by a 30 min imbibition with water, 
followed by a 1 min incubation in 70% ethanol, 10 min in 

25 10% bleach and five rinses with sterile water. Sterilized 
seeds were grown in MS medium (Murashige and Skoog 1962), 
supplemented with 1% sucrose. Plants were usually 
maintained in 16 h light/8 h dark cycle at 23°C, unless 
otherwise indicated. The cell culture was grown in medium 

30 containing Murashige and Skoog salts, 2x Gamborg's vitamins 
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(Gamborg et al. 1968), 1 mg/liter 2 , 4-dichlorophenoxy- 
acetic acid (2,4-D), and 3% (wt/vol) sucrose. Cell cultures 
were incubated in 50 ml of medium in 250 ml conical flasks 
on a rotary shaker at 110 rpm at 23°C in dim light. Cell 
suspensions were subcultured every 7 days and were diluted 
threefold with fresh medium at each subculture. 

All constructs were introduced into Agrobacterium 
tumefaciens LBA4404 (Hoekema et al. 1983), using a 
triparental mating procedure with the helper plasmid 
pRK2013 (Ditta et al . 1980). Root explants of A. thaliana 
were transformed using the method described by Valvekens et 
al. (1988) with one modification: Root explants were taken 
from 15-day-old seedlings grown on vertically placed agar 
plates (Huang and Ma 1992) . Seeds of transgenic plants were 
germinated in the presence of 50 ug/ml of kanamycin. 

Histochemical assay of GU5 expression 

Histochemical assays of GUS expression were carried 
out with 5-bromo-4-chloro-3-indolyl glucuronide (X-Gluc, 
Duchefa) as a substrate and were performed on intact 
seedlings or excised organs of mature plants grown in vivo 
as described by Jefferson (1987) . The samples were treated 
with 70% ethanol for 2-6 h to remove chlorophyll from 
tissues where necessary. 

Purification of SR proteins from Arabidopsis and 
immunoblotting 

SR proteins were purified from 3-week-old Arabidopsis 
plants or 5-day-old suspension cultures using a two-step 
salt precipitation method as described (Lopato et al. 
1996a) . Total protein extracts were prepared with the 
buffer for SR protein isolation (Zahler et al. 1992). 
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Proteins from the magnesium precipitate and total protein 
extracts were separated on a 12.5% SDS gel. Protocols for 
immunoblotting and detection have been described earlier 

(Lopato et al . 1999). Protein dephosphorylat ion was carried 
out with alkaline phosphatase from calf intestine 

(Biolabs) . 

Expression of atSRp30 and atSRp34/SRl in bacteria 
The coding region of atSRp30 cDNA was amplified by PCR 
using the primers 5 ' -ATATA CCATGG GTAGCCGATGGAATCGTAC-3 ' and 
(4) . 

The coding region of atSRp34/SRl cDNA was amplified by 
PCR using the primers 5 ' -ATATA CCATGG GCAGTCGT-TCGAG-3 ' and 

(6). The primers contain Ncol and BamHI restriction sites. 
To obtain the Ncol restriction site, the fourth nucleotide 
of the coding region in both cases was changed to G. Thus 
expressed atSRp30 and atSRp34/SRl have Ser2-Arg and Ser2- 
Gly substitutions, respectively. The fragments were cloned 
into the bacterial expression vector pET-3d (Novagen) and 
transformed into the E. coli strain BL2 1 ( DE3 ) pLysS 

(Novagen) . 

Single colonies were grown in 100 ml of LB medium to a 

density of 0.4-0.5 OD 550 , transferred to 900 ml of fresh 
pre-warmed medium and incubated for 1 h. The cultures were 
induced with 0.4 mM IPTG and were grown for a further 5 h. 
The bacterial pellet was washed twice in 500 ml of ice-cold 
washing buffer (100 mM NaCl, 10 mM Tris-HCl at pH 7.5) and 
used for inclusion body isolation: 2 grams of bacterial 
pellet was resuspended in 18 ml of ice-cold buffer A (10 mM 
Tris-HCl at pH 7.9, 100 mM KC1, 2 mM DTT, 35% sucrose). 4.5 
ml of ice-cold buffer B (333 mM Tris-HCl at pH 8.0, 100 mM 
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EDTA at pH 8.0, 40 mg of lysozyme) was added and incubated 
on ice for at least 10 min. Lysis buffer (22.5 ml, 1 M 
LiCl, 20 mM EDTA, 0.5% NP-40) was added, and the solution 
was sonicated for a total of 2 min at full power (15 sec 
5 bursts with cooling periods of 1 min in between) . Following 
centrifugation for 10 min (Sorvall, SS34, 13000 rpm) , the 
pellet was washed twice in 25 ml of buffer C (10 mM Tris- 
HC1 at pH 8.0, 0.1 mM EDTA, 0.5 M LiCl, 0.05% NP-40, 1 mM 
DTT) , and then twice with the same buffer lacking LiCl. The 
10 isolated inclusion bodies contained >95% pure protein and 
were used for polyclonal antibody preparation (Lopato et 
al. 1999) . 

RESULTS 

Isolation and sequencing of genomic and cDNA clones of 
15 atSRp30 

Initially, plant SR proteins in magnesium chloride 
pellets from protein extracts of carrot and tobacco cell 
cultures and from Arabidopsis plants were identified. The 
proteins were detected with the monoclonal antibody mAbl04, 

20 specific for a shared phosphoepitope in all SR proteins, 
and with a monoclonal antibody specific for the human 
SF2/ASF splicing factor (Lopato et al . 1996a) . A protein of 
approximately 50 kD from the carrot SR preparation, which 
was recognized by both antibodies, was isolated and 

25 partially sequenced. Two peptides with significant homology 
to human SF2/ASF and atSRp34/SRl were found. The first 
peptide comprised the RNP-2 submotif of the first RRM, and 
the second peptide had high homology to a sequence situated 
between RNP-2 and RNP-1 of the second RRM (see Materials 

30 and methods; Fig. 2, above). Based on these sequences, 
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degenerate primers were synthesized and used for PCR on 
purified genomic DNA from carrot, tobacco, and Arabidopsis. 
Sequencing of the cloned PCR products revealed two very 
homologous sequences from Arabidopsis and tobacco. The 
5 Arabidopsis fragment was 838 bp long and contained five 
introns. The fragment borders of the Arabidopsis fragment 
are marked with black arrows in Fig. 1A. The protein 
sequence derived from the PCR sequence had extensive 
homology to both human SF2/ASF and atSRp34/SRl (Fig. 2). 

10 The PCR product from Arabidopsis was used as a probe to 
screen a X ZAPII genomic library of A. thaliana. One 
genomic clone was found and designated GatSRp30 (genomic 
clone of Arabidopsis thaliana serine/arginine-rich protein 
with derived molecular mass of approximately 30 kD) . It was 

15 >4.5 kb long, had 1805 bp of promoter region, but ended 

12 bp upstream of the stop codon . A corresponding cDNA was 
obtained from an expressed sequence tag (EST) cDNA library 
of A. thaliana ecotype Columbia and sequenced. The mRNA 
corresponding to this cDNA was designated mRNAl of atSRp30. 

20 Primers derived from the 3' untranslated region of this 
cDNA were used to obtain the missing sequences of the 
genomic clone by PCR amplification on purified genomic DNA. 
The DNA fragment contained an additional intron, and the 
rest was identical to the corresponding sequence of the 

25 cDNAl . The sequence of GatSRp30 and the derived protein 
sequence are shown in Fig. 1A. 

To facilitate comparison of features of atSRp30 and of 
the closely related Arabidopsis atSRp34/SRl protein (Lazar 
et al. 1995), the genomic sequence of this protein was 

30 determined using PCR. GatSRp34 has a very similar gene 
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structure to that of GatSRp30, except for an intron present 
in the 5' untranslated region of the former. The fragment 
of GatSRp34 comprising the part from exon 11 to the stop 
codon is shown in Fig. 1C, where the alternative splicing 
5 events involving intron 11 are also indicated. 

Comparison of atSRp30 to other SR proteins 
The derived protein seguences of atSRp30 and 
atSRp34/SRl are very homologous (80.7% similarity and 67.1% 
identity) to each other, and both show very high similarity 

10 (75.3 and 77.8%, respectively) and identity (58.1 and 

59.4%, respectively) to human SF2/ASF (Fig. 2). As atSRp30 
and atSRp34/SRl are less homologous to other animal or 
plant SR proteins identified to date, both proteins can be 
considered true orthologs of human SF2/ASF. Within their 

15 amino-terminal portions, all three proteins contain two 
RRMs with their conserved RNP-2 and RNP-1 submotifs, 
whereby the second RRM is atypical and contains the 
invariant SWQDLKD signature characteristic of SF2/ASF-like 
SR proteins (Birney et al. 1993). However, unlike 

20 atSRp34/SRl and SF2/ASF, the RRMs of atSRp30 are not 

separated by a glycine-rich 'hinge' region but rather by a 
serine-rich seguence . The RS domain of atSRp30 is shorter 
than the one of atSRp34 because of an extension at the 3' 
end of atSRp34 which includes a previously described 

25 positively charged proline/serine/lysine-rich (PSK) domain 
of unknown function (Lazar et al. 1995). If this unigue 3' 
extension is not taken into account, atSRp34/SRl is 
slightly more homologous to human SF2/ASF, mainly because 
of their common G-rich hinge region. Taken together, these 

30 analyses suggest that in contrast to mammals, for which 
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only one SF2/ASF protein has been described to date, two 
SF2/ASF homologs exist in Arabidopsis. 

RNA distribution and alternative splicing forms of 
atSRp30 and atSRp34/SRl 

5 RNA blots of poly (A) + mRNA from various tissues of 

wild-type Arabidopsis plants were probed with radioactive 
labeled atSRp30 or atSRp34/SRl cDNAs and revealed at least 
two mRNA species in each case (Fig. 3A) . The level of 
expression of each gene varied considerably in different 

10 tissues but in both cases was highest in flowers, followed 
by roots (Fig. 3A) . In addition, the ratio of the two 
discernible mRNAs, mRNA3 and mRNAl, was different for each 
gene in the various organs. For atSRp30, mRNA 3 seemed to be 
more abundant in leaves, stems and flowers, whereas mRNAl 

15 was predominant in roots. In contrast, the ratio of the two 
main mRNAs of atSRp34/SRl was approximately 1:1 in leaves 
and stems, whereas in roots and flowers mRNAl predominated. 
Reprobing of the RNA blot with the long tenth intron of 
atSRp30 showed that mRNA3 retained sequences of this 

20 intron, indicating that alternative splicing involves this 
region of atSRp30 pre-mRNA. 

The full sequences of the various mRNA isoforms were 
obtained by RT-PCR cloning from total RNA using primers 
from the 5' and 3' ends of the genes (see Materials and 

25 Methods) . mRNAl corresponded to the cDNA sequence of 

atSRp30, whereas mRNA 3 retained part of the 10 th intron, 
because of usage of an alternative 3' splice site (Fig. 1A, 
B) . The mRNA 2 isoform with both alternative 3' and 5' 
splice sites within intron 10 (Fig. 1A, B) was not visible 
30 in Northerns of wild-type plants (see Fig. 3A) and was only 
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detected by RT-PCR by increasing the number of cycles (data 
not shown) . 

The same approach was used to sequence the mRNA 
isoforms of atSRp34/SRl (Fig. 1C) . The longest mRNA 
5 species, which is also the most abundant in wild-type 
plants, was named mRNA 5 and retained most of intron 11 
(which is in the position corresponding to intron 10 of 
atSRp30) except for an internal intron of approximately 8 0 
nucleotides. Two other minor alternative mRNAs could be 

10 detected during early plant development, but only by RT-PCR 
(data not shown). mRNA4 uses an additional alternative 3' 
splice site; mRNA2 uses alternative 5' and 3' splice sites 
(Fig. 1C, D) ; and mRNA3, which is the main alternatively 
spliced mRNA in plants when atSRp30 is overexpressed from 

15 the 35SCaMV promoter (Fig. 5, below) uses one alternative 
5' splice site. 

The above alternatively spliced mRNAs of atSRp30 and 
atSRp34/SRl all had an in-frame stop codon near the 5' end 
of the retained intronic sequences. The hypothetical 

20 shorter proteins, designated atSRp30s and atSRp34s, lack 

the carboxy-terminal part of the RS domain and instead have 
other sequences that are shown in bold in Fig. 1A for 
atSRp30s and in Fig. 1C for atSRp34s. 

To obtain information about changes in the 

25 transcriptional and alternative splicing patterns of 

atSRp30 during plant development, total RNA was isolated 
from whole plants of different ages and used for RNA blot 
hybridization and RT-PCR analysis (Fig. 3B, C) . Primers 

derived from exons adjacent to the 10 th intron were used 
30 for RT-PCR. The results from both methods were in good 
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agreement and showed that the expression of mRNAl is 
highest in younger plants and starts to decline around day 
12, whereas the expression of mRNA3 is extremely low in 
young seedlings, peaks between days 9-14, and declines 
5 slowly afterwards. Although it is not known how the ratio 
of these two transcripts is regulated, these regulatory 
events might determine the quantity of genuine atSRp30 
protein in individual cells. 

Antibdoy detection of Arabidopsis SR proteins 

10 Polyclonal antibodies were raised in chickens against 

purified recombinant atSRp30 and atSRp34/SRl and used to 
identify antigens in different ammonium sulfate fractions 
of Arabidopsis extracts. The extracts were fractionated by 
sequential precipitation with ammonium sulfate and 

15 magnesium chloride, to enrich for SR proteins (Roth et al . 
1990) . The anti-p30 and anti-p34 antibodies recognized 
proteins in the 60%-90% ammonium sulfate part. The 
immunoreactive proteins in the dialyzed 60%-90% part 
precipitated quantitatively in the presence of 20 mM 

20 magnesium chloride. No crossreaction of plant proteins with 
the pre-immune immunoglobulin fraction was found (data not 
shown) . 

The magnesium precipitate was immunoblotted and probed 
with four different antibodies. With anti-p30, six protein 

25 bands were detected as three doublets, with the 43- to 46- 
kD doublet showing the highest intensity. The two other 
doublets migrated with apparent molecular masses of 38-40 
and 31-34 kD (Fig. 4A, lane 2) . The immunostaining pattern 
with the monoclonal antibody mAbl04 (Fig. 4A, lane 1) , 

30 which is specific for a serine phosphoepitope common to SR 
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proteins (Roth et al. 1990) was very similar, although the 
smallest band was more pronounced than with anti-p30 (lane 
2) . This could be indicative of an additional SR protein of 
approximately 30 kD, or it may reflect the presence of 
multiple or stronger phosphoepitopes within p30. No high 
molecular mass proteins were detected with the mAbl04 
antibody, although Lazar et al. (1995) have reported 
immunoreactive proteins up to 120 kD. 

The anti-p34 antibodies recognize mainly three 
proteins of approximately 46-47, 40 and 34 kD. As the same 
antibody shows minimal cross-reaction with recombinant 
atSRp30 (data not shown) , these bands possibly represent 
atSRp34/SRl-related proteins and correlate well with 
published overexpression data in which the protein was 
immunodetected with a size of 47-48 kD (Lazar et al. 1995). 
The 40-kD protein comigrates with an alternatively spliced 
isoform of atSRp34/SRl (see below), whereas the nature of 
the 34-kD polypeptide remains to be determined. 
Interestingly, the immunoblot with anti-p34 is very similar 
to that obtained with a monoclonal antibody specific for 
human SF2/ASF (Fig. 4A, cf . lanes 3 and 4) . Therefore, 
anti-hSF2, which recognizes a discontinuous epitope within 
RRM1 of hSF2/ASF but does not recognize other human SR 
proteins (Hanamura et al . 1998) cross-reacts with the same 
set of Arabidopsis proteins as anti-p34, supporting the 
notion that hSF2/ASF resembles atSRp34/SRl in this region 
more closely than it does atSRp30 . 

The fact that atSRp30 and atSRp34/SRl are present in 
the SR protein preparation and comigrate with specific 
bands in the mAbl04 immunoblot suggests that both proteins 
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are phosphorylated. To test this suggestion, the SR protein 
preparation was treated with increasing concentrations of 
alkaline phosphatase and for different times (Fig. 4B, C) . 
Compared to the untreated controls (lanes 1, 2) , these 

5 treatments resulted in an increase in the mobility of all 
immunostained bands. Under these conditions, anti-p30 
recognized predominantly three proteins (27, 31, and 
38 kD) , whereas anti-p34 recognized two proteins (31 and 
38 kD) . As expected, all these proteins were no longer 

10 stained with mAbl04, except for the 38-kD band, which 
probably represents a partially dephosphorylated 
intermediate resistant to further dephosphorylation (data 
not shown) . Interestingly, dephosphorylation led to the 
appearance of new bands of higher apparent molecular mass, 

15 consistent with the observation that unphosphorylated 

recombinant atSRp30 and atSRp34/SRl are very insoluble and 
have a strong tendency to aggregate. 

Transcriptional activity of SR protein promoters in 
Arabidopsis 

20 It is striking that in contrast to mammals, 

Arabidopsis possesses two SF2/ASF-like proteins. As 
revealed by the Northern blot analysis, the genes of both 
proteins are transcribed in all plant organs. It was 
therefore of interest to investigate if both genes are 

25 generally transcribed in all tissues or specifically in 

different tissues, or to which extent they have overlapping 
activities. To this end, the promoters of GatSRp30 and 
GatSRp34, including the 5' untranslated regions up to the 
start codon, were fused to the coding region of a reporter 

30 (3-glucuronidase gene and transferred into A. thaliana 
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plants by Agrobacterium tumef aciens-mediated root 
transformation. p-Glucuronidase (GUS) activity was never 
observed in any tested nontransgenic Arabidopsis tissues, 
as well as in transgenic plants harboring a promoterless 
5 GUS gene. Control plants containing a 35 S cauliflower 
mosaic virus (CaMV) promoter-Gus fusion were easily 
stained, demonstrating that substrate accessibility was not 
limiting GUS activity. To exclude an influence of the 
transgene position on promoter activity, the analyses were 

10 carried out with several independent transgenic lines. 

Using detailed histochemical analyses, it was found 
that both genes had distinct but also overlapping patterns 
of transcriptional activities. Flowers in the postanthesis 
stage (Fig. 5A, E) showed essentially the same staining 

15 patterns for both genes, with the most pronounced 

expression in the pollen grains and weak staining of sepals 
and style. Additionally, weak activity was observed in the 
abscission zone of the flower in atSRp30-GUS transgenic 
plants (Fig. 5A) . The leaves of atSRp30-GUS plants showed 

20 strongest staining in vascular bundles and in the support 
cells of each trichome (Fig. 5B) , whereas the atSRp34-GUS 
fusion was weakly active only in secondary veins and in the 
cells around them, but activity was never observed in 
trichomes or support cells (Fig. 5F) . The differences in 

25 the GUS staining patterns of these genes were especially 
evident during the different stages of lateral root 
development. Both atSRp30 and atSRp34-GUS fusions were 
active at the very first stages of lateral root initiation, 
when pericycle cells are stimulated, de-differentiate, and 

30 proliferate. Then, at the later stages, when 
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redif f erentiation occurs to form the lateral root meristem, 
atSRp30-GUS was expressed in the basal enlarged cells of 
the forming lateral root (Fig. 5C) , whereas atSRp34-GUS 
expression was characteristic of the activated root 
5 meristem (Fig. 5G) . The staining patterns during early 

seedling development provided additional evidence for the 
distinct transcriptional activities of atSRp30 and 
atSRp34/SRl. Each of the genes was expressed in specific 
regions along the apical-basal axis of the seedling. In 

10 atSRp30-GUS seedlings, expression was restricted to the 
cotyledons (Fig. 5D) , whereas in atSRp34-GUS seedlings, 
expression was observed in hypocotyl and in root (Fig. 5H) , 
suggesting that these genes can be transcriptionally active 
in regions with different patterns of cell division. This 

15 suggestion was confirmed by differential expression of 

atSRp30 and atSRp34/SRl during lateral root formation, as 
well as in trichome support cells. 

Overexpression of atSRp30 in transgenic plants 

In Drosophila, ectopic overexpression of SRp55/B52 

20 resulted in various developmental abnormalities, although 
the identity of affected transcripts remains unknown (Kraus 
and Lis 1994) . It was therefore of interest to study the 
effect of overexpression of atSRp30 on plant development 
and on the alternative splicing patterns of individual 

25 plant transcripts. The constructs used contained either a 
complete gene (pG30) or a cDNA (pC30) encoding atSRp30 
under the control of the strong constitutive promoter of 
the 35S RNA from CaMV. The 35S CaMV promoter is strong and 
constitutive in all plant tissues studied, which was 

30 confirmed in control experiments using GUS as a reporter 
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gene. To control for the conditions of the regeneration and 
transformation procedure in the RNA analysis of 
transf ormants , negative controls were either transformed 
with a 3 5 S RNA promoter-GUS control (pBI121) or were 
5 transf ormants with the same construct, but which for 
unknown reasons did not show overexpression . 

Both fusion constructs, pG30 and pC30, were used for 
Agrobacterium-mediated transformation of Arabidopsis roots. 
Forty independent transgenic lines were regenerated for 

10 each construct; eight of the lines transformed with pG30 

and twelve transformed with pC30 were used in further work. 
Some of these transgenic lines were used for RNA blot and 
RT-PCR analyses as shown in Fig. 6A and B. Surprisingly, 
all pG30 transf ormants (Fig. 6A, B, lanes 1-4) expressed 

15 mainly mRNA3, which possesses an alternative 3' splice site 
within the long intron (Fig. IB) , but nevertheless, mRNAl 
was still more abundant in these plants than in the control 
plant (Fig. 6A, lane 10) , as judged by the RNA blot 
analysis (four times more product was loaded in lane 10 in 

20 Fig. 6B) . Although mRNAl levels in both types of 

transf ormants are quite different (Fig. 6A) , the RT-PCR 
shows similar yields (Fig. 6B) because a high number of 
cycles was employed to detect all alternatively spliced 
products. As expected, all transf ormants containing pC30 

25 overexpressed only mRNAl (Fig. 6A, lanes 5-9) . In both 

types of transf ormants , overexpressed mRNAs were >100-fold 
more abundant than in wild-type plants. Interestingly, when 
pC30 transf ormants were analyzed by RT-PCR, only mRNA2 , 
which uses cryptic 3' and 5' splice sites in intron 10 

30 (Figs. 6B and IB), was present, whereas a small amount of 
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mRNA3 was visible in wild-type plants. This result shows 
that an excess of atSRp30 changes 5 ' -splice-site selection 
of the endogenous atSRp30 pre-mRNA. 

In light of the above results, it was of interest to 

5 measure overexpression of atSRp30 at the protein level. 
Total soluble protein extracts of transgenic plants were 
analyzed by Western blotting using anti-p30 for 
immunodetection (Fig. 6C) and compared to a control plant 
(lane 8, a transformed plant with no overexpression of 

10 atSRp30) and to an SR protein preparation from an 

Arabidopsis suspension culture (lane 1) . The SR protein 
preparation had the characteristic pattern for anti-p30 
(cf. Fig. 6C, lane 1 and Fig. 4A, lane 2), whereas no 
specific proteins were immunostained in the total protein 

15 extract of the control plant (Fig. 6C, lane 8), because of 
the low abundance of atSRp30 in plants. In contrast, a 
specific protein band was visible in the pC30 transf ormants 
(lanes 3-7), comigrating with a protein band in the SR 
protein preparation (lane 1) . In contrast, in the pG30 

20 transf ormant (lane 2) , only a faint protein band was 
detected. These results correlate well with the RNA 
expression pattern of mRNAl in both types of transf ormants 
(Fig. 6A) . It is interesting that overexpression of atSRp30 
yielded an immunostained protein of 38 kD and not a band 

25 comigrating with the more abundant 43- to 46-kD doublet 

seen in the SR protein preparation (Fig. 6C, lane 1) . One 
likely explanation is that overexpressed atSRp30 is only 
partially phosphorylated, which may be related to the 
observation that even exhaustive dephosphorylation results 

30 in a 38-kD atSRp30 protein that retains the mAbl04 
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phosphoepitope (Fig. 4B, lanes 3-6) . However, no shorter 
protein product (atSRp30s) of the highly abundant mRNA3 in 
pG30 transf ormants could be detected with any antibody used 
(see also Fig. 6C, lane 2) . 
5 Phenotypic changes in plants overexpressing atSRp30 

Overexpression of atSRp30 resulted in strong 
phenotypes with pleiotropic changes both in morphology and 
development of the transgenic plants. No significant 
differences were observed between plants transformed with 

10 pG30 and pC30 constructs, although the levels of atSRp30 

protein were different (Fig. 6C, cf . lane 2 and lanes 3-7) . 
The observations were reproducible in and subsequent 
generations of independent transgenic lines, and cosegrated 
with antibiotic resistance. In transgenic plants, the 

15 transition from vegetative to flowering stage was delayed 
greatly under short day conditions. The time from 
germination to the formation of the first mature silique 
was 65-78 days in overexpressing plants, compared to 42-47 
days in control plants grown under the same conditions. In 

20 addition, adult plants showed reduced apical dominance, 

resulting in a 'bushy' phenotype with an increased number 
of secondary inflorescences and a shortened primary 
inflorescence (Fig. 7C) . Growing of the transgenic plants 
under long-day conditions resulted in partial reversion to 

25 a normal phenotype. But despite of the partial recovery of 
apical dominance, the transition to flowering and the time 
to senescence remained delayed. Under long-day conditions, 
the majority of plants from overexpressing lines started to 
flower between 35 and 39 days after germination, whereas 

30 control plants flowered between days 25 and 28. The initial 
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inflorescences were very short and did not exceed 0.5-1 cm 
when the first flower opened. The size of flowers in 
transgenic plants was approximately 30% larger than in 
control plants (4 and 3 mm, respectively) . Transgenic 
5 plants also had larger rosette leaves (Fig. 7A, cf. to wild 
type in B) with trichomes having predominantly four to five 
branches, compared to wild-type leaves, which had mainly 
three-branched trichomes. In strong overexpressing lines, 
the primary inflorescence produced numerous secondary 

10 branches with a vegetative rosette-like appearance. These 
branches developed from 7-15 vegetative-like leaves and 
finally formed inflorescences. Under long-day conditions, 
the rosette leaf numbers determined at the time of 
flowering were somewhat lower in transgenic lines compared 

15 to the wild type (an average of 12 and 16 leaves, 

respectively) . Independent transgenic lines overexpressing 
atSRp30 displayed the described characteristics to various 
extents, with the main impact on the time for transition 
from the vegetative to the reproductive phase. 

20 Overexpressed atSRp30 modulates alternative splicing 

in vivo 

The influence of overexpression of atSRp30 on its own 
alternative splicing pattern and the high homology of 
atSRp30 to human alternative splicing factor SF2/ASF 

25 suggested that this protein could be a modulator of plant 
splicing. This assumption cannot be tested in vitro, as no 
plant splicing extracts are available. Therefore, total RNA 
preparations from pG30 transf ormants (lower level of 
atSRp30 overexpression) and pC30 transf ormants (higher 

30 level of atSRp30 overexpression) were used for RT-PCR 
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analysis of several plant introns that are known to be 
processed alternatively under wild-type conditions. 

Among several genes tested, such as FCA (Macknight et 
al. 1997), rubisco activase (Werneke et al . 1989), agamous 
5 (Yanofsky et al . 1990), and LSD1 (Dietrich et al. 1997), no 
changes in the splicing patterns were observed in plants 
overexpressing atSRp30. However, pre-mRNA splicing of three 
plant genes was found to be affected by overexpression, as 
described below. 

10 The splicing factor atRSp31 belongs to a novel family 

of plant RS proteins and possesses an alternatively spliced 
second intron (792 nucleotides) (Lopato et al. 1996b) . In 
the wild-type plant, either a cryptic 3' splice site or 
both alternative 3' and 5' splice sites were used in this 

15 intron (Fig. 8A) , whereas the latter form prevailed in 

control lines (lane 1) and pG30 transformed lines (lanes 2- 
5) . In contrast, pC30 lines (lanes 6-10) expressed mainly 
the alternative 3 r splice site form, although its abundance 
was variable (in lanes 6 and 7 this form is only visible in 

20 the original photograph) . These transcripts are probably 

subject to nonsense-mediated decay and potentially code for 
the same truncated protein. 

Another well-described alternative splicing event 
occurs in the case of the U1-70K gene of Arabidopsis, in 

25 which transcripts retaining intron 6 are more abundant in 
most tissues (except roots) than the correctly spliced 
transcript (Golovkin and Reddy 1996) . The RT-PCR results 
from control plants (Fig. 8B, lanes 1, 9, 10) confirm this 
observation. However, in both atSRp30 transf ormants , either 

30 the lower expressing pG30 lines (lanes 2 and 3) or the 
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higher expressing pC30 lines (lanes 4-8), most of the 
transcripts were spliced correctly and only a low level of 
intron retention was observed. This regulatory event could 
potentially influence the level of active protein. 
5 The most interesting case of alternative splicing 

modulation was found in the case of the long intron (intron 
11) of GatSRp34, which is a close homolog of GatSRp30 (Fig. 
9) . In wild-type plants (Fig. 9, lane 1) , part of this 
intron is alternatively spliced to generate mRNA5. However, 

10 in pG30 transf ormants (lanes 2-5), mRNA3 was the main 
alternative spliced form, whereas the level of mRNAl 
appeared unchanged. Surprisingly, when atSRp30 was highly 
overexpressed, the alternatively spliced mRNA3 became the 
main transcript, whereas the amount of mRNAl was 

15 considerably decreased. Assuming that shorter PCR products 
are probably overrepresented, the reduction of mRNAl may be 
even more drastic. This assumption was confirmed by 
immunoblot analysis of total protein extracts: atSRp34/SRl 
was deleted in control plants (Fig. 9B, lane 3) and in a 

20 pG30 transf ormant (lane 5) , but not in a pC30 transf ormant 
(lane 4) . However, a smaller specific protein band 
(atSRp34s, 38 kD) became visible in this case, indicating 
that the protein product of mRNA3 predominates in this 
transf ormant . These results clearly show that 

25 overexpression of atSRp30 strongly influences gene 
expression of its close homolog, atSRp34/SRl. 
DISCUSSION 

atSRp30 is a member of the SR protein family 
In addition to their characteristic domains, which 
30 include one or two RRMs and a carboxy-terminal RS domain 
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with multiple SR dipeptides, SR proteins possess 
phosphoepitope (s) recognized by mAbl04, and they are 
insoluble in the presence of millimolar concentrations of 
magnesium salts. The antibody to atSRp30 recognized a 
5 complex pattern of bands in the Mg precipitate, whereas in 
total protein fractions no specific band could be detected. 
The failure to detect atSRp30 in crude lysates reflects the 
low abundance of the protein and/or the limited sensitivity 
of the antibody. However, when atSRp30 was overexpressed, 

10 the antibody detected a specific protein with an apparent 
mobility of 40 kD. In contrast, atSRp34/SR could be 
detected by anti-p34 in total protein fractions. The 
immunoblot data confirm that atSRp30 is a true SR protein, 
as it is present in SR protein preparations and shows 

15 corresponding immunostained bands with mAbl04. However, as 
complete dephosphorylation could not be achieved, it is not 
known how many of these proteins represent modified forms 
of atSRp30 or closely related proteins. 

Is the expression of atSRp30 autoregulating? 

20 When atSRp30 pre-mRNA was overexpressed, mRNA3, which 

is generated from use of an alternative 3' splice site in 
the tenth intron, was the main transcript detected, whereas 
the level of mRNAl was enhanced only moderately (Fig. 6) . 
There are several possible explanations for this 

25 phenomenon. First, there may be a limiting splicing factor 
that is titrated out because of overexpression of the 
atSRp30 gene. Such a factor would have to be specific for 
the tenth intron, as all the other introns in atSRp30 are 
spliced properly. Second, mRNA 3 and mRNAl may be 

30 synthesized for the most part in different cells, via cell- 
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type-specific alternative splicing. As the constitutive 35S 
CaMV promoter used in the overexpression experiments is 
active in most tissues the large amount of mRNA3 may 
reflect inappropriate expression in cell types in which a 
5 factor required for correct processing of the tenth intron 
is absent. To clarify this issue, experiments aimed at 
determining the cells in which the alternative splicing 
event takes place will be necessary. 

The fact that the ratio of mRNA3 to mRNAl is constant 

10 in all transformants overexpressing GatSRp30 (Fig. 6A) is 
consistent with a potential autoregulatory mechanism 
involving atSRp30. Overproduced atSRp30 protein may 
stimulate the alternative splicing event, thus down- 
regulating its own expression, as has been shown to be the 

15 case for several animal splicing factors, such as Sxl (Bell 
et al. 1991), tra (Mattox and Baker 1991), and SWAP (Zachar 
et al. 1987) from Drosophila, as well as mouse SRp20 (Jumaa 
and Nielsen 1997). In agreement with this model, it was not 
possible to detect a protein product from mRNA3 (atSRp30s) . 

20 Evidence was obtained for an influence of atSRp30 on 

alternative splicing of transcripts from the endogenous 
gene in plants overexpressing the atSRp30 cDNA (Fig. 6B, 
lanes 5-9) , resulting in preferential use of an alternative 
5' splice site. Overexpression of atSRp30 might have a 

25 different effect on the large amount of exogenous pre-mRNA 
present in cells where this protein is not expressed 
normally resulting in preferential use of an alternative 3' 
splice site (Fig. 6A, B, lanes 1-4) . Unfortunately, this 
hypothesis cannot be tested in vitro because no plant 

30 splicing extracts are available. Future experiments should 
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address whether atSRp30 can indeed autoregulate its own 
production. 

atSRp30 overexpression influences splice-site choice 
in several plant pre-mRNAs and induces changes in plant 
5 development 

Many in vivo and in vitro experiments have shown that 
human SF2/ASF is involved in the selection of 5' splice 
sites by binding cooperatively to this site (Zuo and Manley 
1994) and to UlsnRNP (Kohtz et al. 1994; Jamison et al. 

10 1995/ Zahler and Roth 1995) . Furthermore, like other SR 

proteins, SF2/ASF influences splice-site selection in part 
by binding to enhancer sequences that are present 
frequently in intronic or exonic regions. Upon binding to 
those elements, the protein activates splicing by 

15 recruiting the splicing mechanism to an adjacent splice 

site through protein-protein interactions (Wu and Maniatis 
1993; Amrein et al. 1994; Zuo and 

Manley 1994) . Arabidopsis atSRp30 was a good candidate for 
being a splicing modulator, given its similarity to human 

20 SF2/ASF and its peculiar expression pattern. The ability to 
stably overexpress atSRp30 in whole plants that remained 
viable made it possible to determine, for the first time, 
the effects of increased SR protein levels on alternative 
splicing of specific endogenous transcripts. Some but not 

25 all endogenous transcripts tested were affected. In the 

case of atSRp30, pre-mRNA itself, and tSRp34 / SRl/pre-mRNA, 
intronic alternative 5' splice sites were activated, 
whereas in atRSp31 and Ul 70K pre-mRNAs, the usage of the 
normal splice sites was generally enhanced but variable. 

30 There is no explanation for this variability in individual 
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transformants (see Fig. 8), as the atSRp30 protein levels 
in several pC30 transformants were similar (Fig. 6C) . The 
most pronounced effect of atSRp30 overexpression was on the 
splicing pattern of the close homolog atSRp34/SRl, in which 
5 preferential use of the alternative 5' splice site gave 
rise mostly to a truncated mRNA isoform, atSRp34s, and 
virtually no atSRp34/SRl protein. Although there is no 
information to date concerning the activity of the smaller 
atSRp34s, these results suggest the normal expression of 

10 atSRp30 protein in the same cells that transcribe 

atSRp34/SRl is important for the synthesis of authentic 
atSRp34/SRl protein. This regulatory loop might explain the 
different expression patterns of both factors in root 
tissue and young seedlings. It remains to be determined 

15 whether there is a reciprocal effect of atSRp34/SRl 

overexpression on alternative splicing of atSRp30 pre-mRNA. 

Plants overexpressing atSRp30 const itutively and 
ubiquitously showed interesting changes in morphology, as 
well as in several aspects of development. Although some of 

20 the details will have to be investigated more thoroughly, 
many of the differences can be explained by changes in 
phase transition and in the definition of cell fates. In 
this work it has been demonstrated that atSRp30 can change 
the expression of other genes drastically by influencing 

25 their splicing patterns. As the expression pattern of 
atSRp30 is normally very tissue specific, some of the 
observed effects of overexpression might simply be caused 
by its expression in inappropriate tissues, where it 
influences the expression of genes that are not the natural 

30 targets of this splicing factor. In addition, the observed 
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decrease in the expression of the splicing factor atSRp34 
upon overexpressing atSRp30 may in turn affect the 
expression of other genes, leading to the observed 
phenotypic changes. On the other hand, the changes in 
5 trichome development resulting in additional branching may 
reflect overexpression of atSRp30 in the supporting cell of 
the trichomes, which is a normal site of atSRp30 
expression. Therefore, atSRp30 may be a determinant of 
trichome development. Finding the natural targets of 

10 regulation by atSRp30 remains a critical goal to explain 
the observed changes in phenotype, as well as to promote 
the understanding of developmental pathways in plants. 
Two SF2/ASF-like factors in Arabidopsis 
Immunoblot analysis with mAbl04 has suggested 

15 previously that the complexity of the SR protein family is 
higher in animals than in plants (Lopato et al . 1996a), as 
several higher molecular mass proteins have been identified 
in SR preparations from mammals but not from those in 
plants. The existence of two SF2/ASF-like factors in 

20 Arabidopsis may compensate for the absence of orthologs of 
other members of the SR protein family. The similar gene 
structure of atSRp30 and atSRp34/SRl is indicative of an 
ancestral gene duplication event. Interestingly, the 
penultimate long intron is conserved in both genes and is 

25 involved in alternative splicing events. Judging from 

Northern blot analysis of RNAs from different plant organs, 
the ratio of the various SR protein mRNAs is quite variable 
and is probably reflected in the abundance of the resulting 
proteins. In both genes, the alternative splicing event 

30 results in predicted proteins with a truncated RS region 
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and a few new additional carboxy-terminal amino acids. 
Whereas a shorter protein of atSRp34/SRl could be detected 
clearly in plants overexpressing atSRp30, atSRp30s was 
never observed in these plants, although the corresponding 
5 alternatively spliced mRNA is abundant. Expression of 

atSRp30s cDNA in Escherichia coli yielded stable protein 
(data not shown) . Therefore, either mRNA3 is not translated 
efficiently, or it is degraded through a premature nonsense 
codon-mediated decay pathway, or atSRp30s is preferentially 

10 recognized by one of the plant protein-degradation systems. 

Comparative analysis of atSRp30 with all known plant 
and animal SR proteins revealed that it is a paralog of 
atSRp34/SRl and that both Arabidopsis proteins are closest 
to human SF2/ASF. The main structural difference is that 

15 atSRp30 lacks a G-rich region between the two RRMs, which 
may influence the flexibility and perhaps the specificity 
of the RNA-binding region. The notion that the two proteins 
might have distinct activities is strengthened by the 
observation that in many cases their expression patterns 

20 are guite different, as shown in Fig. 5. In general, 

atSRp30 expression is more confined to specialized cell 
types and tissues, like trichomes, cotyledons, or lateral 
root primordia, suggesting a special role for this protein 
in the initiation of organ formation, whereas atSRp34/SRl 

25 is more strongly expressed in meristematic tissue. 

Furthermore, the present sequence and immunological data 
indicate that atSRp34/SRl is more similar to hSF2/ASF than 
is atSRp30. Taken together, these data suggest that 
atSRp34/SRl is a more general splicing factor, similar to 

30 hSF2/ASF, whereas atSRp30 might have more specialized 
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functions, perhaps acting as a regulatory splicing factor 
that modulates alternative splicing and gene expression in 
specific cell types. 
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